Probabilistic record linkage and a method to calculate the positive predictive value.

نویسندگان

  • Tony Blakely
  • Clare Salmond
چکیده

BACKGROUND Computerized record linkage is commonly used in cohort studies to ascertain the study outcome, and as such its accuracy classifying the outcome can be described using the standard epidemiological terms of sensitivity and positive predictive value (PPV). METHOD We describe a 'duplicate method' to calculate the PPV of record linkage when each record can only be involved in one match (e.g. linking population files to death files). The method does not require a validation subset of records from both files with detailed personal information (e.g. name and address), and is therefore ideal for linkage projects using anonymous data. The duplicate method assumes that the number of records from one file with zero, one, two, etc., links from the other file is distributed in a manner predicted by combinatorial probabilities. Having made this assumption, the number of false positive links, and hence the PPV, are estimable. We demonstrate this duplicate method using output from anonymous and probabilistic record linkage of census and mortality records in New Zealand. RESULTS The PPV estimates conform to the pattern expected based on the underlying theory of probabilistic record linkage, and were robust to sensitivity analyses. We encourage other researchers to further assess the accuracy of this method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Probabilistic Linkage of Persian Record with Missing Data

Extended Abstract. When the comprehensive information about a topic is scattered among two or more data sets, using only one of those data sets would lead to information loss available in other data sets. Hence, it is necessary to integrate scattered information to a comprehensive unique data set. On the other hand, sometimes we are interested in recognition of duplications in a data set. The i...

متن کامل

[Accuracy of the probabilistic record linkage methodology to ascertain deaths in survival studies].

Probabilistic record linkage methodology has been increasingly used to ascertain outcomes in cohort studies. However, only a few studies have evaluated its accuracy. The aim of this study was to evaluate the accuracy of probabilistic record linkage methodology to ascertain deaths in a cohort of 250 elderly people hospitalized for fractures caused by falls. The vital status of cohort members was...

متن کامل

Accuracy of probabilistic record linkage applied to health databases: systematic review.

OBJECTIVE To analyze both national and international literature on validity of record linkage procedure of health databases focusing on quality assessment of results. METHODS A systematic review of cohort, case-control, and cross-sectional studies that evaluated quality of probabilistic record linkage of health databases was conducted. Cochrane methodology of systematic reviews was used. The ...

متن کامل

Practical introduction to record linkage for injury research.

The frequency of early fatality and the transient nature of emergency medical care mean that a single database will rarely suffice for population based injury research. Linking records from multiple data sources is therefore a promising method for injury surveillance or trauma system evaluation. The purpose of this article is to review the historical development of record linkage, provide a bas...

متن کامل

Accuracy of Probabilistic Linkage Using the Enhanced Matching System for Public Health and Epidemiological Studies

BACKGROUND The Enhanced Matching System (EMS) is a probabilistic record linkage program developed by the tuberculosis section at Public Health England to match data for individuals across two datasets. This paper outlines how EMS works and investigates its accuracy for linkage across public health datasets. METHODS EMS is a configurable Microsoft SQL Server database program. To examine the ac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • International journal of epidemiology

دوره 31 6  شماره 

صفحات  -

تاریخ انتشار 2002